# Fine-tuning Optimization
Mask2former Finetuned ER Mito LD5
Other
An image segmentation model fine-tuned on the Dnq2025/Mask2former_Pretrain dataset, based on the facebook/mask2former-swin-base-IN21k-ade-semantic model
Image Segmentation
Transformers

M
Dnq2025
26
0
Lightblue Reranker 0.5 Cont Filt Gguf
A text ranking model fine-tuned based on Qwen2.5-0.5B-Instruct, suitable for information retrieval and relevance ranking tasks
Large Language Model
L
RichardErkhov
2,130
0
Belle Whisper Large V3 Turbo Zh
Apache-2.0
A Chinese speech recognition model fine-tuned based on whisper-large-v3-turbo, showing significant performance improvements in multiple Chinese speech recognition benchmarks
Speech Recognition
Transformers

B
BELLE-2
2,891
55
T5 Small Finetuned V2 Hausa To Chinese
Apache-2.0
A Hausa-to-Chinese translation model fine-tuned based on T5-small, achieving a BLEU score of 30.0183 on the evaluation set.
Machine Translation
Transformers

T
Kumshe
15
1
Vit GPT2 Image Captioning Model
An image caption generation model based on the ViT-GPT2 architecture, capable of converting input images into descriptive text
Image-to-Text
Transformers

V
motheecreator
142
0
Learn Hf Food Not Food Text Classifier Distilbert Base Uncased
Apache-2.0
This is a text classification model fine-tuned on DistilBERT-base-uncased for distinguishing between food and non-food text content.
Text Classification
Transformers

L
mrdbourke
238
2
Xfinder Llama38it
xFinder-llama38it is a fine-tuned key answer extraction model based on Llama3-8B-Instruct, designed to improve the accuracy and robustness of key answer extraction from large language model outputs.
Large Language Model
Transformers English

X
IAAR-Shanghai
189
5
Deepfake Audio Detection
Apache-2.0
A speech processing model further fine-tuned based on wav2vec2-base-finetuned, achieving 98.82% accuracy on the evaluation set
Speech Recognition
Transformers

D
motheecreator
1,468
7
Deepfake Audio Detection
Apache-2.0
A fine-tuned speech processing model based on wav2vec2-base-finetuned, achieving 98.82% accuracy on the evaluation set
Speech Recognition
Transformers

D
mo-thecreator
801
7
Paligemma 3b Pt 448
PaliGemma is a lightweight and versatile vision-language model built on the SigLIP vision model and Gemma language model, supporting multilingual image-text interaction tasks.
Image-to-Text
Transformers

P
google
2,708
29
Tinyllama Essay Scorer
Apache-2.0
A fine-tuned essay scoring model based on TinyLlama-1.1B
Large Language Model
Transformers

T
as-cle-bert
19
2
T5 Small Finetuned Nl2sql
Apache-2.0
A T5-small fine-tuned NL2SQL model for converting natural language to SQL queries
Large Language Model
Transformers

T
Shritama
27
1
Belle Distilwhisper Large V2 Zh
Apache-2.0
A Chinese speech recognition model fine-tuned based on distilwhisper-large-v2, with a speed 5.8 times faster than whisper-large-v2 and 51% fewer parameters
Speech Recognition
Transformers

B
BELLE-2
230
37
Layout Qa Hparam Tuning
A document QA model fine-tuned based on microsoft/layoutlmv2-base-uncased, suitable for document layout understanding and QA tasks
Question Answering System
Transformers

L
PrimWong
14
0
Whisper Small Turkish Tr Best
Apache-2.0
Turkish speech recognition model fine-tuned based on OpenAI Whisper-small, with a word error rate of 26.34%
Speech Recognition
Transformers

W
erenfazlioglu
61
4
Git Base Next
MIT
Fine-tuned image-to-text model based on microsoft/git-base
Image-to-Text
Transformers Other

G
swaroopajit
19
1
Distilhubert Finetuned Gtzan
Apache-2.0
This model is a fine-tuned version of DistilHuBERT on the GTZAN music classification dataset, primarily used for music genre classification tasks.
Audio Classification
Transformers

D
pollner
24
0
Opus Mt Ko En Finetuned
Apache-2.0
A Korean-English translation model fine-tuned based on Helsinki-NLP's opus-mt-ko-en model
Machine Translation
Transformers

O
yeeunlee
21
1
Saved Model Git Base
MIT
A vision-language model fine-tuned on image folder datasets based on microsoft/git-base, primarily used for image caption generation tasks
Image-to-Text
Transformers Other

S
holipori
13
0
Segformer B0 Finetuned Segments Test
Other
An image segmentation model fine-tuned on the bilal01/stamp-verification-test dataset based on nvidia/mit-b0
Image Segmentation
Transformers

S
bilal01
15
0
Swin Tiny Patch4 Window7 224 Isl Finetuned
Apache-2.0
A vision model fine-tuned based on microsoft/swin-tiny-patch4-window7-224, achieving 100% accuracy on the evaluation set
Image Classification
Transformers

S
hazardous
17
0
Detr Resnet 50 Finetuned Cppe5
Object detection model fine-tuned on the cppe-5 dataset based on facebook/detr-resnet-50
Object Detection
Transformers

D
Mustafa21
15
0
Videomae Base Finetuned
A video understanding model fine-tuned on an unknown dataset based on the VideoMAE base model, achieving 86.41% accuracy on the evaluation set
Video Processing
Transformers

V
LouisDT
15
0
Xtremedistil L6 H384 Uncased Finetuned Squad
MIT
This model is a fine-tuned version of microsoft/xtremedistil-l6-h384-uncased on the SQuAD dataset, primarily used for question answering tasks.
Question Answering System
Transformers

X
tachyon-11
20
0
Swin Base Finetuned Cifar100
Apache-2.0
This model is an image classification model fine-tuned on the CIFAR-100 dataset based on the Swin Transformer architecture, achieving an accuracy of 92.01%.
Image Classification
Transformers

S
MazenAmria
119
1
Whisper Medium Jp
Apache-2.0
Japanese speech recognition model fine-tuned on the common_voice_11_0 dataset based on openai/whisper-medium
Speech Recognition
Transformers Japanese

W
vumichien
4,542
25
Xlm Roberta Base Finetuned Urdu
Urdu sentiment classification model based on xlm-roberta-base architecture, capable of binary sentiment classification for Urdu sentences
Text Classification
Transformers Other

X
Aimlab
129
3
Segformer B0 Finetuned Segments Water 2
Apache-2.0
An image segmentation model fine-tuned on the imadd/water_dataset dataset based on nvidia/mit-b0, designed for water segmentation tasks
Image Segmentation
Transformers

S
imadd
51
1
Resnet 50 Ucsat
Apache-2.0
An image classification model fine-tuned based on microsoft/resnet-50, demonstrating medium accuracy on an unknown dataset
Image Classification
Transformers

R
YKXBCi
24
0
Distilbert Base Uncased Becas 4
Apache-2.0
A text classification model fine-tuned on the becasv2 dataset based on distilbert-base-uncased
Large Language Model
Transformers

D
Evelyn18
20
0
Distilbert Base Uncased Finetuned Squad
Apache-2.0
A question-answering model based on DistilBERT, fine-tuned on the SQuAD dataset for extractive question answering tasks.
Question Answering System
Transformers

D
lingchensanwen
16
0
Deberta Base Finetuned Aqa
MIT
A QA model fine-tuned on the adversarial_qa dataset based on microsoft/deberta-base
Question Answering System
Transformers

D
stevemobs
15
0
Convnext Tiny Finetuned Beans
Apache-2.0
This model is an image classification model fine-tuned on the beans dataset based on the ConvNeXt-Tiny architecture, achieving an accuracy of 96.09%.
Image Classification
Transformers

C
mrm8488
15
1
Xtremedistil L12 H384 Uncased Finetuned Wikitext103
MIT
This model is a fine-tuned version of microsoft/xtremedistil-l12-h384-uncased on the wikitext dataset, primarily used for text generation tasks.
Large Language Model
Transformers

X
saghar
16
1
Wav2vec2 Base Cv
Apache-2.0
A speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-base
Speech Recognition
Transformers

W
jiobiala24
24
0
Wav2vec2 Large Xls R 300m Pt Colab
Apache-2.0
A speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

W
tonyalves
17
0
Distilhubert Ft Common Language
Apache-2.0
This model is a fine-tuned audio classification model based on distilhubert trained on a common language dataset, primarily used for language recognition tasks.
Audio Classification
Transformers

D
anton-l
17
2
Distilroberta Base Model Transcript
Apache-2.0
A text processing model fine-tuned based on the distilroberta-base model, suitable for general NLP tasks
Large Language Model
Transformers

D
mahaamami
14
0
Sagemaker Distilbert Emotion
Apache-2.0
A text sentiment classification model based on DistilBERT, fine-tuned on the emotion dataset with an accuracy of 92.9%
Text Classification
Transformers

S
jpabbuehl
21
0
NER RUBERT Per Loc Org
A lightweight Russian named entity recognition model based on BERT architecture, supporting the identification of three types of entities: person, location, and organization.
Sequence Labeling
Transformers

N
tesemnikov-av
15
0
- 1
- 2
Featured Recommended AI Models